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ENTERED 



1600 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/758 , 017B 



DATE: 05/30/2003 
TIME: 13:47:41 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\05302003\l758017B.raw 

3 <110> APPLICANT: Lanes, Olav 

4 Willasen, Nils Peder 

5 Guddal, Per Henrik 

6 Gjellesvik, Dag Rune 

8 <120> TITLE OF INVENTION: COD URACIL-DNA GLYCOSYLASE, 



GENE CODING THEREFORE, 



RECOMBINANT 
9 
10 
11 
13 
15 
16 
18 
19 
21 
22 
24 
26 
28 
29 
30 
31 
■ 33 
34 
35 
36 



DNA 



CONTAINING SAID GENE OR OPERATIVE PARTS THEREOF, A METHOD FOR PREPARING SAID 
PROTEIN AND THE USE OF SAID PROTEIN OR SAID OPERATIVE PARTS THEREOF IN 
MONITORING OR CONTROLLING PCR 
<130> FILE REFERENCE: U 013209-3 
<140> CURRENT APPLICATION NUMBER: 09/758, 017B 
<141> CURRENT FILING DATE: 2001-01-10 
<150> 



<151> 
<150> 
<151> 



PRIOR APPLICATION NUMBER: 2000 5428 
PRIOR FILING DATE: 2000-10-27 
PRIOR APPLICATION NUMBER: 2000 0163 
PRIOR FILING DATE: 2000-01-12 
<160> NUMBER OF SEQ ID NOS : 22 
<170> SOFTWARE: Patentln version 3.1 
SEQ ID NO: 1 
LENGTH: 1283 
TYPE: DNA 

ORGANISM: GADUS MORHUA 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (18) . . (920) 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 
38 <400> 
39 



OTHER INFORMATION: 
W — > 38 <400> 1 

gacatccgct tgcaaat atg ttg ttc aag tta ggg tta tgc caa aga tgc 
4 0 Met Leu Phe Lys Leu Gly Leu Cys Gin Arg Cys 

41 15 10 

43 ata tea tea aat egg gty tta cca ggt tta eta att ccc caa act tta 
W — > 44 lie Ser Ser Asn Arg Xaa Leu Pro Gly Leu Leu lie Pro Gin Thr Leu 
45 15 20 25 

47 tgt ttt tct aaa tta atg aag ata acg ccg aag aaa ctg agg tec tea 
4 8 Cys Phe Ser Lys Leu Met Lys lie Thr Pro Lys Lys Leu Arg Ser Ser 
49 30 35 40 

51 aat gtg gaa caa aag acg tea teg cca cag ctt tea gtg gag cag ctg 

52 Asn Val Glu Gin Lys Thr Ser Ser Pro Gin Leu Ser Val Glu Gin Leu 

53 45 50 55 

55 gaa aga atg gee aaa aat aag aaa gca gcg ctt gac aag att aga gca 

56 Glu Arg Met Ala Lys Asn Lys Lys Ala Ala Leu Asp Lys lie Arg Ala 

57 60 65 70 75 

59 aaa gca acg cct gca ggt ttc gga gag act tgg aga aga gag ctg get 

60 Lys Ala Thr Pro Ala Gly Phe Gly Glu Thr Trp Arg Arg Glu Leu Ala 



50 



98 



146 



194 



242 



290 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: 03/09/758,0176 



DATE: 05/30/2003 
TIME: 13:47:41 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\05302003\l758017B.raw 



61 80 85 90 

63 gca gag ttt gaa aag cca tac ttc aaa caa ttg atg tec ttt gta get 338 

64 Ala Glu Phe Glu Lys Pro Tyr Phe Lys Gin Leu Met Ser Phe Val Ala 

65 95 100 105 

67 gat gag agg age cgt cac acc gtc tac. cca ccg get gat caa gtg tac 386 

68 Asp. Glu Arg Ser Arg His Thr Val Tyr Pro Pro Ala Asp Gin Val Tyr 

69 110 115 120 

71 agt tgg aca gag atg tgt gac att caa gat gtg aaa gta gtg att eta 434 

72 Ser Trp Thr Glu Met Cys Asp lie Gin Asp Val Lys Val Val lie Leu 

73 125 130 135 

75 ggc cag gac cct tac cac ggt ccc aac caa gca cat gga etc tgt ttc 482 
7 6 Gly Gin Asp Pro Tyr His Gly Pro Asn Gin Ala His Gly Leu Cys Phe 
77 140 145 150 155 

79 agt gtg caa aag cca gtt ccc cct ccc ccc agt etc gtg aac ata tac 530 

80 Ser Val Gin Lys Pro Val Pro Pro Pro Pro Ser Leu Val Asn lie Tyr 

81 160 165 170 

83 aaa gaa ttg tgt acc gac att gat ggc ttc aag cat cct gga cat gga 578 

84 Lys Glu Leu Cys Thr Asp lie Asp Gly Phe Lys His Pro Gly His Gly 

85 175 180 185 

87 gat eta age gga tgg gca aaa caa ggg gtg ctg ctg ctt aac gcg gtg 62 6 

88 Asp Leu Ser Gly Trp Ala Lys Gin Gly Val Leu Leu Leu Asn Ala Val 

89 190 195 200 

91 ctg acc gtg egg gec cat cag gec aac tec cac aag gac aga ggc tgg 674 

92 Leu Thr Val Arg Ala His Gin Ala Asn Ser His Lys Asp Arg Gly Trp 

93 205 210 215 

95 gag acc ttc acc gac get gtg ate aag tgg ctg age gtc aac egg gaa 722 

96 Glu Thr Phe Thr Asp Ala Val lie Lys Trp Leu Ser Val Asn Arg Glu 

97 220 225 230 235 

99 gga gtg gtt ttc ctg ttg tgg ggc tea tac gee cat aag aag gga gcg 770 

100 Gly Val Val Phe Leu Leu Trp Gly Ser Tyr Ala His Lys Lys Gly Ala 

101 240 245 250 

103 acc ate gac agg aaa cgt cac cat gtc ttg caa get gtt cat cca tct 818 

104 Thr lie Asp Arg Lys Arg His His Val Leu Gin Ala Val His Pro Ser 

105 255 260 265 

107 cct ttg tct get cat cgt ggg ttc ctt ggt tgt aag cac ttc tec aag 866 

108 Pro Leu Ser Ala His Arg Gly Phe Leu Gly Cys Lys His Phe Ser Lys 

109 270 275 280 

111 get aac ggg ctg ctg aaa eta tct ggg acg gag cct ata aac tgg aga 914 

112 Ala Asn Gly Leu Leu Lys Leu Ser Gly Thr Glu Pro lie Asn Trp Arg 

113 285 290 295 

115 gca etc taactcttta tgetgectta tactgttaac tgttttaaga tgaacatcac 970 

116 Ala Leu 

117 300 

119 actatatttt ctacagcttt tccaagttca aaccaatcta "taagctttca tttgtctttt 1030 
121 ggaatgatgc tgcttttggt eggttttaga tacttaaaac actttaccac tetgecatgt 1090 
123 tgactcatgt tcagtcaata taactttcac aacttgaaca aaaatgttat tttataattg 1150 
125 attatattct gtacattaaa gattgttttt ttcccaggct gtttcatagg tactaggata 1210 
127 ttaaactgtt attaacctat tttccatgat gtcaactget taagttttta tgcagaaata 1270 
129 aattatatat tta 1283 
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132 <210> SEQ ID NO: 2 

133 <211> LENGTH: 301 

134 <212> TYPE: PRT 

135 <213> ORGANISM: GADUS MORHUA 

137 <220> FEATURE : 

138 <221> NAME/KEY: misc_f eature 

139 <222> LOCATION: (17).. (17) 

14 0 <223> OTHER INFORMATION: The 'Xaa* at location 17 stands for Val. 

142 <400> SEQUENCE: 2 



14 4 
X 4 H 


Mot- 
net 


Leu 




Lys 


Leu 




T .01 1 
IjC u 


^ y 0 


OX 11 


Arg 


uy 0 


I 1p 

X X c 


Ser 


Ser 


Asn 


Arg 


1 4 R 


i 
x 








r 










10 










15 




1 4ft 

149 


Aao 




Pro 


Gly 


Ii6U 


Leu 


He 


Pro 


Gin 


Thr 


Leu 


Cys 


Phe 


Ser 


Lys 


Leu 


1 / Q 

X f± J 








j£ yj 










25 










30 






1 R9 


Mot* 
trie L 


Lys 


11c 


Thr 


Prn 

IT X U 


Xjy s 


Xjy o 


be u 


Arg 


Ser 


Ser 


Asn 


Val 


Glu 


Gin 


Lys 


1 R^ 


























45 








1 Rfi 

1JO 


Th r 


X 


C q 

Ocl 


IT d-\J 


Gin 


Leu 


Ser 


Val 


Glu 


Gin 


Leu 


Glu 


Arg 


Met 


Ala 


Lys 


1 R7 
ID / 
























60 










1DU 


7\ o 'O 


Lys 


Lys 


7\ "1 -a 


rild 


Leu 


nop 


Lys 


X X c5 


rtx y 


ai ^ 


Xjy 0 


Al ^ 

JTiX O 


Thr 


Pro 


Ala 


1 £1 
X D X 


0 O 










7 n 




















80 


1 £4 




IT llti 


oiy 


OX LI 


Thr 


ixp 


Arg 


Arg 


Glu 


Leu 


Ala 


Ala 


Glu 


Phe 


Glu 


Lys 


IDj 










O J 




















95 




1DO 


JrX O 


lyr 




Lys 




Xitr Li 


1 it: L. 


0 c j_ 


Phe 


Val 


Ala 


Asp 


Glu 


Arg 


Ser 


Arg 


1D3 








i on 

1UU 










1 os 










110 






17? 


His 


Thr 


Val 


i yx 


Pro 


Pro 


Ala 


Asp 


Gin 


Val 


Tvr 

1 y x 


Ser 


Tro 


Thr 


Glu 


Met 


1 IT 

X I -J 






IIS 
11J 










1 90 

-L j£. U 










125 








1 76 




nap 


x x c; 


Gln 


Asp 


Val 


Lys 


Val 


Val 


He 


Leu 


Gly 


Gin 


Asp 


Pro 


Tvr 


17 7 
Iff 




i 

X j u 










1 

1 jj 










14 0 

X *1 VJ 










180 


His 


Gly 


Pro 


Asn 


Gin 


Ala 


His 


Gly 


Leu 


Cys 


Phe 


Ser 


Val 


Gin 


Lys 


Pro 


181 


145 










150 










155 










160 


184 


Val 


Pro 


Pro 


Pro 


Pro 


Ser 


Leu 


Val 


Asn 


He 


Tyr 


Lys 


Glu 


Leu 


Cys 


Thr 


185 










165 










170 










175 




188 


Asp 


He 


Asp 


Gly 


Phe 


Lys 


His 


Pro 


Gly 


His 


Gly 


Asp 


Leu 


Ser 


Gly 


Trp 


189 








180 










185 










190 






192 


Ala 


Lys 


Gin 


Gly 


Val 


Leu 


Leu 


Leu 


Asn 


Ala 


Val 


Leu 


Thr 


Val 


Arg 


Ala 


193 






195 










200 










205 








196 


His 


Gin 


Ala 


Asn 


Ser 


His 


Lys 


Asp 


Arg 


Gly 


Trp 


Glu 


Thr 


Phe 


Thr 


Asp 


197 




210 










215 










220 










200 


Ala 


Val 


He 


Lys 


Trp 


Leu 


Ser 


Val 


Asn 


Arg 


Glu 


Gly 


Val 


Val 


Phe 


Leu 


201 


225 










230 










235 










240 


204 


Leu 


Trp 


Gly 


Ser 


Tyr 


Ala 


His 


Lys 


Lys 


Gly 


Ala 


Thr 


He 


Asp 


Arg 


Lys 


205 










245 










250 










255 




208 


Arg 


His 


His 


Val 


Leu 


Gin 


Ala 


Val 


His 


Pro 


Ser 


Pro 


Leu 


Ser 


Ala 


His 


209 








260 










265 










270 






212 


Arg 


Gly 


Phe 


Leu 


Gly 


Cys 


Lys 


His 


Phe 


Ser 


Lys 


Ala 


Asn 


Gly 


Leu 


Leu 


213 






275 










280 










285 








216 


Lys 


Leu 


Ser 


Gly 


Thr 


Glu 


Pro 


He 


Asn 


Trp 


Arg 


Ala 


Leu 








217 




290 










295 










300 











220 <210> SEQ ID NO: 3 

221 <211> LENGTH: 1355 
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RAW SEQUENCE LISTING DATE : 05/30/2003 

PATENT APPLICATION: US/09/758 , 017B TIME: 13:47:41 

Input Set : A:\EP.txt 

Output Set: N:\CRF4\05302003\l758017B.raw 

222 <212> TYPE: DNA 

223 <213> ORGANISM: GADUS MORHUA 

225 <220> FEATURE: 

226 <221> NAME/KEY: CDS 

227 <222> LOCATION: (90).. (992) 

228 <223> OTHER INFORMATION: 
W— > 230 <400> 3 

231 gatggtttag gaggatagta ctttgacact ggttagcgaa ggggaaaacg gagttattgt 60 

233 gcatatcgtt ttagccctac gtttaaaaa atg att ggt caa cag cat ate aac 113 

234 Met lie Gly Gin Gin His lie Asn 

235 1 5 

237 tct ttc ttc tea cca gtt tea aaa aag aga gtt tea aag gaa tta ggt 161 

238 Ser Phe Phe Ser Pro Val Ser Lys Lys Arg Val Ser Lys Glu Leu Gly 

239 10 15 20 

241 aaa ace gaa aag cat gee gaa gaa gtt cag ata acg ccg aag aaa ctg 209 

242 Lys Thr Glu Lys His Ala Glu Glu Val Gin lie Thr Pro Lys Lys Leu 

243 25 30 35 40 

245 agg tec tea aat gtg gaa caa aag acg tea teg cca cag ctt tea gtg 257 

24 6 Arg Ser Ser Asn Val Glu Gin Lys Thr Ser Ser Pro Gin Leu Ser Val 

247 45 50 55 

24 9 gag cag ctg gaa aga atg gee aaa aat aag aaa gca gcg ctt gac aag 305 

250 Glu Gin Leu Glu Arg Met Ala Lys Asn Lys Lys Ala Ala Leu Asp Lys 

251 60 65 70 

253 att aga gca aaa gca acg cct gca ggt ttc gga gag act tgg aga aga 353 

254 He Arg Ala Lys Ala Thr Pro Ala Gly Phe Gly Glu Thr Trp Arg Arg 

255 75 80 85 

257 gag ctg get gca gag ttt gaa aag cca tac ttc aaa caa ttg atg tec 

258 Glu Leu Ala Ala Glu Phe Glu Lys Pro Tyr Phe Lys Gin Leu Met Ser 

259 90 95 100 

261 ttt gta get gat gag agg age cgt cac ace gtc tac cca ccg get gat 449 

262 Phe Val Ala Asp Glu Arg Ser Arg His Thr Val Tyr Pro Pro Ala Asp 

263 105 110 115 120 

265 caa gtg tac agt tgg aca gag atg tgt gac att caa gat gtg aaa gta 497 

266 Gin Val Tyr Ser Trp Thr Glu Met Cys Asp He Gin Asp Val Lys Val 

267 125 130 135 

269 gtg att eta ggc cag gac cct tac cac ggt ccc aac caa gca cat gga 545 

270 Val He Leu Gly Gin Asp Pro Tyr His Gly Pro Asn Gin Ala His Gly 

271 140 145 150 

273 etc tgt ttc agt gtg caa aag cca gtt ccc cct ccc ccc agt etc gtg 593 

274 Leu Cys Phe Ser Val Gin Lys Pro Val Pro Pro Pro Pro Ser Leu Val 

275 155 160 165 

277 aac ata tac aaa gaa ttg tgt ace gac att gat ggc ttc aag cat cct 641 

278 Asn He Tyr Lys Glu Leu Cys Thr Asp He Asp Gly Phe Lys His Pro 

279 170 175 180 

281 gga cat gga gat eta age gga tgg gca aaa caa ggg gtg ctg ctg ctt 689 

282 Gly His Gly Asp Leu Ser Gly Trp Ala Lys Gin Gly Val Leu Leu Leu 

283 185 190 195 200 

285 aac gcg gtg ctg ace gtg egg gee cat cag gec aac tec cac aag gac 737 

286 Asn Ala Val Leu Thr Val Arg Ala His Gin Ala Asn Ser His Lys Asp 



401 
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PATENT APPLICATION: US/09/758 , 017B TIME: 13:47:41 
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287 










205 










210 










215 






289 


aga 


ggc 


tgg 


qaq 


acc 


ttc 


acc 


gac 


get 


gtg 


ate 


aag 


tgg 


ctg 


age 


gtc 


785 


290 


Arg 


Gly 


Trp 


Glu 


Thr 


Phe 


Thr 


Asp 


Ala 


Val 


lie 


Lys 


Trp 


Leu 


Ser 


Val 




291 








220 










225 










230 








293 


aac 


egg 


gaa 


aaa 


Qtq 


gtt 


ttc 


ctg 


ttg 


tgg 


ggc 


tea 


tac 


gee 


cat 


aag 


833 


294 


Asn 


Arg 


Glu 


Gly 


Val 


Val 


Phe 


Leu 


Leu 


Trp 


Gly 


Ser 


Tyr 


Ala 


His 


Lys 




295 






235 










240 










245 










297 


aag 


gga 


gcg 


acc 


ate 


gac 


agg 


aaa 


cgt 


cac 


cat 


gtc 


ttg 


caa 


get 


gtt 


881 


298 


Lys 


Gly Ala 


Thr 


He 


Asp 


Arg 


Lys 


Arg 


His 


His 


Val 


Leu 


Gin 


Ala 


Val 




299 




250 










255 










260 












301 


cat 


cca 


tct 


cct 


ttg 


tct 


get 


cat 


cgt 


ggg 


ttc 


ctt 


ggt 


tgt 


aag 


cac 


929 


302 


His 


Pro 


Ser 


Pro 


Leu 


Ser 


Ala 


His 


Arg 


Gly 


Phe 


Leu 


Gly 


Cys 


Lys 


His 




303 


265 










270 










275 










280 




305 


ttc 


tec 


aag 


get 


aac 


ggg 


ctg 


ctg 


aaa 


eta 


tct 


ggg 


acg 


gag 


cct 


ata 


977 


306 


Phe 


Ser 


Lys 


Ala 


Asn 


Gly 


Leu 


Leu 


Lys 


Leu 


Ser 


Gly 


Thr 


Glu 


Pro 


He 




307 








285 










290 










295 






309 


aac 


tgg 


aga 


gca 


etc 


taactcttta tgetgectta tactgttaac tgttttaaga 


1032 


310 


Asn 


Trp 


Arg 


Ala 


Leu 


























311 








300 




























J X J 


tgaacatcac actatatttt ctacagcttt tccaagttca 


aaccaatcta taagctttca 


1092 


315 


tttgtctttt ggaatgatgc tgcttttggt cggttttaga 


tacttaaaac actttaccac 


1152 


317 


tctgccatgt tgactcatgt tcagtcaata taactttcac 


aacttgaaca aaaatgttat 


1212 


319 


tttataattg attatattct gtacattaaa gattgttttt 


ttcccaggct gtt teat agg 


1272 


321 


tactaggata ttaaactgtt attaacctat tttccatgat 


gtcaactget taagttttta 


1332 


323 


tgcagaaata aattatatat tta 




















1355 


326 


<210> SEQ ID NO: 


: 4 


























327 


<211> LENGTH: 301 


























328 


<212> TYPE: 


PRT 




























329 


<213> ORGANISM: 


GADUS MORHUA 




















331 


<400> SEQUENCE: 


4 


























333 


Met 


He 


Gly 


Gin 


Gin 


His 


He 


Asn 


Ser 


Phe 


Phe 


Ser 


Pro 


Val 


Ser 


Lys 




334 


1 








5 










10 










15 






337 


Lys 


Arg 


Val 


Ser 


Lys 


Glu 


Leu 


Gly 


Lys 


Thr 


Glu 


Lys 


His 


Ala 


Glu 


Glu 




338 








20 










25 










30 








341 


Val 


Gin 


He 


Thr 


Pro 


Lys 


Lys 


Leu 


Arg 


Ser 


Ser 


Asn 


Val 


Glu 


Gin 


Lys 




342 






35 










40 










45 










345 


Thr 


Ser 


Ser 


Pro 


Gin 


Leu 


Ser 


Val 


Glu 


Gin 


Leu 


Glu 


Arg 


Met 


Ala 


Lys 




346 




50 










55 










60 












349 


Asn 


Lys 


Lys 


Ala 


Ala 


Leu 


Asp 


Lys 


He 


Arg 


Ala 


Lys 


Ala 


Thr 


Pro 


Ala 




350 


65 










70 










75 










80 




353 


Gly 


Phe 


Gly 


Glu 


Thr 


Trp 


Arg 


Arg 


Glu 


Leu 


Ala 


Ala 


Glu 


Phe 


Glu 


Lys 




354 










85 










90 










95 






357 


Pro 


Tyr 


Phe 


Lys 


Gin 


Leu 


Met 


Ser 


Phe 


Val 


Ala 


Asp 


Glu 


Arg 


Ser 


Arg 




358 








100 










105 










110 








361 


His 


Thr 


Val 


Tyr 


Pro 


Pro 


Ala 


Asp 


Gin 


Val 


Tyr 


Ser 


Trp 


Thr 


Glu 


Met 




362 






115 










120 










125 










365 


Cys 


Asp 


He 


Gin 


Asp 


Val 


Lys 


Val 


Val 


lie 


Leu 


Gly 


Gin 


Asp 


Pro 


Tyr 




366 




130 










135 










140 












369 


His 


Gly 


Pro 


Asn 


Gin 


Ala 


His 


Gly 


Leu 


Cys 


Phe 


Ser 


Val 


Gin 


Lys 


Pro 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 05/30/2003 

PATENT APPLICATION: US/09/758 , 017B TIME: 13:47:42 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\05302003\l758017B.raw 
g^ Please Note: 

I Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
■Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
7 to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:l; Xaa Pos . 17 
Seq#:2;. Xaa Pos. 17 

Invalid <213> Response: 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 



Seq#:14,21,22 
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VERIFICATION SUMMARY DATE: 05/30/2003 

PATENT APPLICATION: US/09/758 , 017B TIME: 13:47:42 

Input Set : A:\EP.txt 

Output Set: N:\CRF4\05302003\l758017B.raw 

L:38 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 1 , Line# : 36 

L:44 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 after pos . : 98 

L:148 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 after pos.:16 

L:230 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 3, Linet : 228 
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